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ABSTRACT 


This paper presents discussion of internal structure of Noun Keywords: noun, pronoun, numeral, 
Phrase (NP) in Urdu. It also proposes a computational non - verbal adjective 

grammar using LFG (Lexical Functional Grammar) 

formalism. 


Introduction 


As in other languages, Noun Phrase (444 ssel, NP) in Urdu also ranges from a single word to 
(theoretically) an infinitely long construction comprising other phrases as its constituents. NP 
may contain a number of word classes and phrases including pronouns, adjectives, nouns and 
quantifiers. It is important to find out the exact rules which govern how NP may be formed in 
Urdu and when these constructions are licensed. This is not only important to understand the 
grammar of Urdu but also essential for developing the computational models of the language. 
Case Phrase (KP1) in Urdu is formed by simply an NP or NP followed by Case Marker, cm). 
KP (and thus NP) forms fundamental argument for other constituent structures, including a 
Sentence (S) and Verb Phrase (VP). The variety in which NP may be constructed in addition to 
its frequent usage in forming or parsing Urdu grammar makes it an essential topic of further 
investigation. 

Various aspects of construction of NP in Urdu have been addressed by many grammar 
books of Urdu. The current work looks at these analyses in conjunction with the requirements 
for Urdu grammar from the corpus of Urdu text to assess and extend the work. Analysis of 





' Since CP is more generally known as abbreviation for other term Complementizer Phrase, hence KP is used here 
as abbreviation for Case Phrase, to avoid confusion. [14] has also used KP to abbreviate Case Phrase. 
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different authors is discussed and eventually a formal set of rules for analyzing NP of Urdu is 
developed. Part of the published material on Urdu grammar is available only in Urdu hence the 
translation of the terminology used in such text is also presented in this paper. However, when 
exact English translation is not found in the literature, then most conceivable translation (as 
apparent from its definition and illustration given in that text) is provided instead of mere 
transliteration, so that the reader may realize the underlying concept and meaning. Lexical 
Functional Grammar (LFG) framework is used to realize these rules. Other formalisms can also 
be used, and the choice of LFG is arbitrary. 

Next section visits the literature; the subsequent section shows the analysis and topic-wise 
segments of the computational grammar presented in this paper. Then, there is a comparative 
analysis of the literature and current work. 


Scope of this paper 


This paper studies NP construction for the purpose of computation grammar for Urdu. The NP 
can contain clauses, other phrases, and elementary items as its constituent. Some constituents 
(phrases and clauses) of NP also have the ability of taking NP as their constituent (relative 
clause, postpositional phrase, and case phrase etc.); while others always appear at a 
subordinating position of NP (e.g. noun, pronoun, numeral, non-verbal adjective). This paper 
focuses on such subordinating items only, because each of other constructions requires its 
separate study and analysis. Therefore, genitive pronoun and verbal adjectives are also not 
included in this paper. Handling of the coordinate conjunction of NP and adjectives is also 
included in the current grammar. 


Literature review 
This section reviews the literature available on elementary constituents of the NP. 


The noun (a) 


Oxford English Dictionary (http://www.oed.com) describes Noun as A word used as the name 
or designation of a person, place, or thing; the class or category of such words. WordNet 
(1998) defines noun semantically as a content word that can be used to refer to a person, place, 
thing, quality, or actionl, or functionally as —the word class that can serve as the subject or 
object of a verb, the object of a preposition, or in apposition.| Nouns are generally divided into 
many sub-classes, which include common vs. proper nouns, and mass vs. countable nouns (e.g. 
see [Jurafsky]). Further sub-classes may also be defined based on syntactic, grammatical or 
semantic roles of different nouns. Urdu grammarians have also sub-classified the nouns, which 
are discussed below. 

Haq M. (1987) divides the nouns in two main classes, proper (v2 vs!) and common ( vs! 
aS). Proper nouns are further divided into four sub-classes, title (444), attributive name (4), 
alias (#48) and nom-de-plume (alas), and common nouns are sub-classified as state (^45), 
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collective (¢ s>), locative (#44) and instrumental (~f) nouns. Haq further divides the locative 
noun into nouns which represent a —locationl in space (U* -244) and time (uea åh), 

Javed (1981) divides the nouns flatly into five types, common (e$), proper (U4), 
collective (s>), abstract (248) and mass (or non-countable, ..» Ux, also referred to as 
material, å+») nouns. Platts T. (1909) does not explicitly discuss the types, but divides nouns 
into five classes based on morphological behavior. He lists abstract nouns, nouns of agency or 
attributes or possession or appellations, nouns of place, diminutive nouns and compounds. 

Siddiqui (1971) presents the most comprehensive classification of nouns. He divides the 
classification on the basis of structure (“4:), nature (=$) and other. The classification on the 
basis of structure (“4:¢) is as follows: 


. Primitive (neither derived nor derivable) Noun (4 ġel), which is sub-classified as proper 


(ua) and common (eS) nouns. The proper nouns are further sub-categorized as title (4:44), 
Appellation or Attributive Name (4), sailA (258), nom-de-plume (04154) and kinship (4#5¢S). 
The common nouns are sub-categorized as state (4S) and collective (¢ s>) nouns. 


b. Verbal Noun (W348 oel), which is the infinitive and gerund form of verb, and generally used as 


Cc. 


noun. The derivational noun below is derived from verbal noun. 

Derivational Noun (448 ġel), which is sub-classified into subject (84), object (JS), present 
(~H), deverbal (UAbs Jb), wage (43%, name of the payment derived from the name of the 
work for which the payment is due), locative (#44, includes locative in —timel ux -e4band 
in —spacel us åh), and instrumental (~) nouns. 

The classification of nouns according to nature ($) is as substantive (aba), quality 
(444) nouns and pronouns ($ Š). Finally, Siddiqui lists other classes of nouns as sound (cub), 
indefinite (+$), relative (Jhb), interrogative (ticle), numeral (248), comparative (J) and 
exaggerative (+h) nouns. (Schmidt, L. 1999) and (Platts, 1909) have not carried out 
hierarchical categorization of noun in this way, they have studied the phenomenon of their 
morphological construction. 

Siddiqui (1971) has shown some categories of noun without embedding them into the 
hierarchy of main types, viz. sound noun (cb oe!, names for sounds of animals or machines 
etc.), indefinite pronoun (as uel, the translation is referred from (Platts, 1909) on the basis of 
description and illustrations of indefinite pronouns (translated by Platts as USS 2 se!) given 
therein, includes words: (any) and @S(some)), relative pronoun (Jha òl), interrogative 
pronoun (ee! gel), cardinal (48 oe), comparative adjective (U4 vel), intensity adjective ( ve! 
~la), and demonstrative pronoun (+! oel). Cardinal are discussed in numeral phrase; 
indefinite pronoun, interrogative pronoun, and demonstrative pronoun are discussed in 
pronouns; and comparative adjective and intensity adjective are discussed under adjective 
phrase. 

As can be seen, different authors have done different classifications. These classifications 
are based on morpho-phonological, syntactic and semantic behavior of nouns. 

Proper noun and common noun are the most common set (or subset) of types (noted in all 
the referenced texts). Urdu has no indication in the script to identify the proper noun (as 
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opposed to the capitalization in English), but the grammatical distinction exists. Proper noun 
does not take plural declension while common noun does, in general. There are special uses of 
proper noun when it is not behaving typically: 


(Haq, 1987) -a Jä GS AV 4) 524 4A—That person is Haatam of his days.| 

(Haq, 1987y- Jci" 4—He is Rustam of Hind.| 

(Javed1981) -= aoi JS ¢l—These days Haatams are not bornel 

Haq (1987) and Siddiqui (1971) have also noted this exceptional usage of proper noun. 

In the above mentioned usage, proper noun is used to refer to some property, state, or feature 
in the context, therefore, Haq (1987) has called it adjective. Javed, (1981) has not given it any 
specific name however he writes that whenever a proper noun is used in metaphoric behavior, it 
starts accepting plural as does the common noun. Siddiqui (1971) has called such an instance of 
proper noun as adjective (because it is signifying some attribute of the entity owning this proper 
noun), for its being the sub-type of generalized noun (45S oel, a proper noun made common 
through AŠ, Arabic word for generalization, so it can be termed generalized noun, 
henceforth), justifies it by distinguishing it from regular common noun. 

The Infinitive form of Verb in Urdu mostly behaves as noun entry (and sometimes as a 
verb to show the aspect of necessity. Platts (1909) has used both labels (Infinitive and Verbal 
Noun) to refer to this type. Platts (1909) has also mentioned that Gerund in Urdu also has the 
same shape (as of Infinitive) and behaves as Verbal Noun. 

Javed (1981) has stated Non-Count Noun and its synonym å% oel, which may be 
translated as Material Noun or Mass Noun, to be the same concept. Quirk & Randolph, et el. 
(1985) have defined Material Noun as something that is used as an ingredient of other products 
(made of/from the former), for example: 


“This cake is made with lot of [eggs]. A bracelet of [solid gold]. A man of [steel]. ” 

The first example indicating —eggs as Material Noun is apparently contradicting with the 
definition of Non-Count Noun, but the other side of the picture is that the example of US 
(wheat) as Non-Count Noun seems apparently uncountable because of the size of each grain in 
proportion to their number, which somehow justifies the —eggs especially in a bulk quantity (or 
may be when the number is not known). Javed (1981) is the only grammar writer who has noted 
this kind of the Noun. An important property of this type of noun is that they do not accept 
Numeral Adjective; rather they are measured in quantity so they accept Quantifying Adjective 
(a.k.a. Quantifier). Nevertheless, this kind is very important and useful to separate from other 
nouns especially when focusing on computational aspects for the sake of agreement with 
context (in terms of selection of one of the two, Numeral and Quantifier). 

However, it is also observed that — — ly 4I (two cups of tea) is shortened to ——» 4 
(two teas). Here the measurement unit is presumed to be — sul, because of the nature of 
substance in question, and the mindset of the participants of conversation. In such uses of 
Numerals (with Non-Count Noun) context is very important, for example —J% îl (two rice) 
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may refer to dis & 3 (two plates of rice) or —Ji= —bs 3 (two bowls of rice) depending on the 
measurement unit most frequently used between the participants of conversation. 


The pronoun (sa awl) 


Platts (1909) has divided pronoun into demonstrative pronoun (“238 44), personal pronoun 
(bai oS), relative pronoun (tbs is), correlative pronoun (Jà oel 34), interrogative 
pronoun (asic! i Š), indefinite pronoun (ASÀ JS), reflexive pronoun (Lsa Ui Š), reciprocal 
pronoun (si U&s§), and pronominal adjectives (4:58 44), Only Platts (1909) has noted the 
pronoun type correlative, however, personal pronoun is used instead of correlative pronoun in 
modern Urdu. (Platts, 1909) 

Javed (1981) has classified pronouns into personal (with subclass honorific), reflexive, 
demonstrative, genitive (), common genitive, indefinite, interrogative, relative, adverbial ( Al‘ 
ŠŠ IS with subclasses, locative, temporal, and manner). Haq (1987) has categorized 
pronouns into personal, relative, interrogative, demonstrative, and indefinite. 

Siddiqui (1971) has noted personal pronoun, relative pronoun, interrogative pronoun, 
demonstrative pronoun, indefinite pronoun, and pronominal adjectives to be the types of 
pronoun. Siddiqui (1971) has divided the pronoun into demonstrative pronoun, personal 
pronoun, relative pronoun, interrogative pronoun, and indefinite pronoun. Siddiqui (1971) has 
shown honorific pronoun (c #454 $Š) and reflexive pronoun to the sub-type of personal 
pronoun. Siddiqui (1971) has also noted that the word @(who) is the only relative pronoun in 
Urdu, and has other forms like —>> (to whom) is the oblique form. 

Schmidt (1999) has shown demonstrative, personal, reflexive, interrogative, indefinite, 
relative, and repeated pronouns. The last type seems a morphological phenomenon of 
reduplication. However, the reduplications illustrated in the text are lexicalized (not generative). 


Phrasal items 


This section reviews the literature available on phrasal constituents of the NP. Since, the 
genitive phrase (thus the genitive pronoun) is considered to be the KP hence, excluded from the 
discussion of NP. 


The adjectives (sua aw) 


Haq (1987) has divided adjectives into personal adjective (ï) “44), associative adjective 
(c> ih), cardinal and pronominal adjective (48 46), quantifier (=la ib), and 
demonstrative pronoun (4 Š Gib). 

Javed (1981) has categorized adjectives as personal adjective, cardinal and pronominal, 
quantifier, intensity adjective (s4245 <4b), demonstrative pronoun ($ Š áb), and comparative 
adjective (~~=). 

Siddiqui (1971) has classified adjectives as personal adjective (~>. 44), associative 
adjective, cardinal and pronominal adjective (also includes fraction (.4#S 248) as its subclass), 
and ordinal (42238 44), 
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Schmidt (1999) has shown interrogative pronoun, indefinite pronoun, adverbial pronoun 
(near, far, interrogative, and relative), repeated adjective, and comparative adjective to be the 
types of adjectives. Schmidt (1999) has also noted possessive adjective, reflexive possessive 
adjective, numbers, and participles used as adjective. Platts (1909) has studied morphological 
behavior of Urdu adjectives. Javed (1981) has shown that the adjective usually precedes the 
noun but for emphasis it may follow. In the following example the adjective is used at the end 
of the sentence: 


isi s Ae GS Ja 5a(Jamil was tired when reached home.) 
“LES Gal Sad 98 8 Gast SO! 2 ua (I saw a cave — scary, dark, and obscure) 


The difference of order in the above examples is to emphasize on the specific attribute of the 
substantive. 

Platts (1909) has noted the use of genitive construction (using genitive diacritic Use 
<aS!), according to the rules of borrowed Persian Morphology (hence generally used with 
borrowed Persian words), where Adjective comes after the Head Noun in the Noun Phrase, e.g. 
Sy 244(a good man, literally —a man of goodl). However, this construction is not frequently 
used in contemporary Urdu. 


The numerals 


Platts (1909) has listed numerals apart from adjectives, and has subdivided them into cardinal, 
ordinal, collective numeral, distributive and multipli-cative, numeral adverb, fractional number, 
currency (rakam). 

Schmidt (1999) has also listed numerals apart from adjectives, and has categorized them as 
cardinal numbers, ordinal numbers, fractions, frequency (once, twice, etc.), multiplicative, 
repeated num-bers, and numbers with oblique plurals. 

Haq (1987), Javed (1981) and Siddiqui (1971) have noted numerals as type of adjectives, 
hence, listed in the adjective phrase below. 


Features and agreement 


The term ce! ~=Y(Essentials of Noun) is used by Haq (1987) to discuss the features of the 
constituents of the NP, and their agreement requirement to form a grammatically valid NP. 
Every noun (and some other constituents of NP, e.g. adjective) in Urdu has the Gender (44) 
feature, which needs to agree with the head of the NP being formed. When a constituent (e.g. 
whole number) does not have the gender restriction for attachment, it is said to have neutral 
gender which means it has set of all possible values for that feature in Urdu, i.e., masculine and 
feminine, as the value for its gender feature. 

Productive Morphotactics have been observed in Urdu for the identification of the gender 
and number of a noun from its shape, but in reality, it is a lexical phenomenon, e.g. S5!(boy, 
example of marked masculine singular noun; the mark is ending vowel) vs. Sprayer, 
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example of a noun having same mark but it is feminine singular), and  S5\(girl, example of a 
marked feminine noun; the mark is ending vowel) vs. (elephant, example of a noun having 
same mark but it is masculine singular). 

The feature of honor, in Urdu, overrides the feature of number: a singular starts requiring 
plural agreement due to increased honor, e.g. =f s4¢l(The teacher came) vs. — 4 je|(The 
teachers came) vs. | Cub asic |(Respected teacher came). It also shows that the agreement is 
not restricted to NP and its constituents only, rather is witnessed across other phrases of the 
sentence, as well. 

Butt (2011) has noted that explicit case (>h) marking is useful for the establishment of the 
semantic roles of nouns (and pronouns) and their syntactic relationship to the verb. The 
freedom in phrase order in Urdu language is due to explicit case marking. For example: 


KJA aa  S5lgives the same meaning as 
Sale [ SiS. 22 (The girl looked at door) 
In the above example, —z marks that — SII is the agent (or the Subject, doing the act of 
seeing) and —S| marks the preceding noun —¢-.] to be the patient (or the Object, the thing 
being seen). So the case is a tool for marking relationships between dependents and the head. 
(Butt, 2011) 
Shape (form) of the gender marked noun also changes on the basis of attachment of Case 
Marker Schmidt (1999): 
(a) Nominative (Š) form: Only for Nominative Case 
(b) Vocative (3/4) form: Only for Vocative Case. This form does not exist for Verbal Nouns. (Platts, 
1909) 
(c) Oblique (2) form: For all other Cases (requiring separate word, case marker, to follow) 


Usually a countable noun allows plural inflection and can follow a cardinal or 
pronominal adjective, whereas a noncount noun allows pronominal attachment of quantifier. 
This constraint of attachment cannot be implemented without having a feature to indicate 
whether the noun is a count or non-count. 


Proposed grammar with analysis 


The proposed grammar is written using the notation of Lexical Functional Grammar. It is 
incremented with discussion on every addition. 


The Noun 


The computational analysis of the Noun portion of the Substantive Phrase (NP consisting 
substantive noun only) has shown that theoretically infinite substantive nouns can occur in an 
NP. 
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SubstantivePhrase + 
[SubstantiveNoun ]* 


SubstantiveNoun 


The Substantive Noun is a POS (Part of Speech) that means Substantive Noun, this term is used 
after (Siddiqui, 1971) and (Platts, 1909). This word is more precise and specific than the term 
Noun that has broader sense. A substantive noun refers to the generic concept that includes 
common noun and proper noun etc. Adding the features to this grammar rule gives: 


Substant ivePhr as e> 
(Substantivenoun] * 


SubstantiveNoun: ?Gender=VGender , 
*Number=tNumber, Case=dCase, 
SCount=/Count, Honor=+Honor ; 


Since the unification works for the agreement of feature values of the current item with its 
composing parent hence only last noun in the string of nouns is coded for agreement, 
considering the last noun to be the head of the NP, most of the times, in Urdu. 


Pronoun 


As the genitive phrase is not in the scope of this paper therefore it is not modeled herein. 


‘bili = 

(Demonstrative: TGender=\cender, 
{Number =/-Number, SCase=1Case, 
tcount=dcount, THonor= Honor; ) 
SubstantivePhrase: Gender=/Gender, 


tNumber=/Number, TCase=+Case, 
tCount=lcount, THonor=tHonor ; 


Pronoun: ?Gender=\Gender , 
+t Number=)Number, tCase=lCase, 
tCount=Lcount, tHonor= Honor; 


In the above rule, Demonstrative refers to the demonstrative pronoun, and parentheses 
around this POS show its being optional. The Substantive Phrase, so far, is same as coded 
above. Pronoun is the POS assigned to all pronoun types except the demonstrative pronoun 
and genitive pronoun. It is important to notice that the Pronoun construction is prevented 
from Demonstrative attachment. 


Adjective phrase (Liras ms) 


The Adjective Phrase may contain multiple adjectival items. 
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AdjectivalPhrase > 

[Adjectival: ?Gender= Gender , 
*Number=dNumber, *Case=VCase, 
*Count=VCount, tHonor=\Honor ;]* 
Adjectival: ?Gender=\Gender , 
*Nunber=dNumber, Case=vCase, 
*Count=/Count, tHonor=+Honor : 

















Substantive Phrase must be modified to incorporate the Adjectival Phrase (Adjective 
Phrase): 


SubstantivePhrase > 


AdjectivalPhrase: SGender=JGender, 
*Number=\Number, tCase=dCase, 
*Count=lCount, Honor=+Honor: 


Takstein 
SubstantiveNoun: *Gender=\Gender , 
*Number=/Number, TCase=dCase, 
*Count= Count, Honor=+Honor: 





The Adjectival Phrase is kept optional. The subsections in this section analyze and build the 
grammar for incorporation of the constituents of adjective phrase. 


The numeral phrase 


General numerals are complex and of recursive nature so it is better to devise a 
component/tool that assists the grammar application in this regard. For example: 


$45 — Us oba Éy CSY gulié «st(three hundred and fifteen lakh sixty thousand four 
hundred and thirteen) 


Quantifier 


Quantifiers occur before typical adjectives. 
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ic dini | 


CQuantifier: tGender=\Gender, 
*Number= Number, (Case=+Case, 
*Count=+Count, TtHonor=dHonor 3 

AdjectivalPhrase: *Gender=\Gender, 
*Number= Number, (Case=+Case, 
Honors Honor : 


Cardinal 


Cardinal is the whole number, and may introduce extra complexity to the grammar if 
generated as ordinary production rule. So it was analyzed that to keep the grammar relevant 
to the grammatical aspects. This idea entails that there exists a small program invoked from 
the grammar to generate or validate the complicated formation of the (theoretically) 


infinitely long whole number, referred to as Composite Cardinal here. 


Numera Phased 


| 


CompositeCardinal: tGender=4Gender, 


ANunber=YNunher, ‘*Case=VCase, 
ACount=4Count, tHonor=)Honar : 


Thus the Adjectival Phrase will become: 
igi acacia | 


(NumeralPhrase: ?Gender=/Gender , 
*Number=JNumber, tCase=4Case, 
*Count=+Count, THonor=+Honer;) 

[Substantivenoun] * 

SubstantiveNoun: ?Gender=/Gender , 
4*Number=/Number, tCase=Case, 
*Count= Count, Honors Honor; 


In the above rule, Substantive Phrase is extended (the ellipses are just shown here to 
indicate that previous rule is extended, not replaced) to incorporate Numeral Phrase is kept 
optional, when it occurs it contains nothing but a Composite Cardinal, till this stage. Feature 
agreement/unification ensures that value of the cardinal must agree with the number feature 
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of the head of the Substantive Phrase (which is the parent production of the Numeral 
Phrase). The Composite Cardinal constructs or validates the construction of the number 
which is a valid string of Cardinals where Cardinal is the POS assigned to such words. 


Ordinals 


Occurrence pattern of ordinal is same as that of demonstrative pronoun, so the Noun Phrase 
must incorporate it at par with Demonstrative: 
NounPhrase + 
£ 
[ 


Demonstrative: TGender=+cGender, 
TAumber=-Numnber, case=lcase, 
Toount=ecount, THonor=-l-Honor ; 
l 
[ 
ordinal: TGender=lGender, 


ThNumber=--Number, Tcase=)case, 
TCount=-count, PHonor=-Honor; 


J 

SubstantivePhrase: ?tGender=/Gender, 
TNumber=Number, TCase=+Case, 
~tCount=-Count, tHonor=+Honer ; 


This is changed Noun Phrase rule (not added). Here Ordinal is the POS for ordinal words in 
the lexicon. It must be noticed that whole construction before the Substantive Phrase in this 
rule is optional. 


Multiplicative 


Multiplicative also cause similar extension the grammar so far. 


NumeralPhrase> | 


| 
Multiplicative: Gender =4Gender , 


Sunber=WNunber, *Case=VCase, 
fCount=VCount, tHonor=)Honor: 


| 
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Multiplicative is the POS for such words in the lexicon; and the ellipses are just shown here 
to indicate that previous rule is extended, not replaced. 


Fractions 


Urdu has two types of fractions, viz. Suffix Fractions (a Cardinal may precede, = oï 
(three-fourth)), and Prefix Fractions (a Unit Cardinal always follows, b+ 5S(one and a 
half thousand)). 


NumeralPhrase> ee 


(Cardinal) 

FractionalSuffix: ?tGender=+Gender, 
TNumber=/Number, Case=+Case, 
TCount=LCount, THonor= Honor; 


(Fractional Prefix) 

Unitcardinal: tGender=SGender , 
tNumber=+Number, tCase=/Case, 
TCount=+Count, t Honor =FHoner ; 


Cardinal, Fractional Suffix, Fractional Prefix, and Unit Cardinal are the POS for the words 
of these categories; and the ellipses are just shown here to indicate that previous rule is 
extended, not replaced. 


POS modifying particles 


There a few morphemes in Urdu that are handled syntactically in this work because of their 
orthographic separation from previous token. 


vaalaa (%19) Construction 


The word¥!? (say, vaalaa-morpheme) has an importance while working on computational 
grammar of Urdu; it adds complexity to the analysis of NP. There are some suffixes shown 
in Siddiqui (1971) that can be used to make noun from an existing verb, of which vaalaa- 
morpheme is the most open to be used as suffix for this purpose. In fact, it forms Adjective 
Phrase, and occurs before the head noun of the Noun Phrase. Just like any other adjective, 
vaalaa-construction, also becomes a noun when the head noun is absent in the NP, for 
example: „>È A JXx(The mad man spoke.) is same as >È J (The mad man spoke.); here 





? The term, vaalaa-morpheme, introduced here, serves for both genders and both numbers in Urdu. However, 
surface form is Masc.Sg:¥!, Fem.Sg: A, Masc.PI: AV’, and Fem.Pl: utl. 
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dropping of the head of the NP causes the pertinent Adjective Phrase to be promoted to NP. 
Similarly, 4426 ss YS (The milk-man spoke.) is same as >8 YI°S(The milk-man spoke.); 
here, again, dropping of the head is inducing promotion of the Adjective Phrase to NP. 
However, the trend of dropping the head noun in vaalaa construction is frequent. 

The noun type, Generalized Noun to be the sub-type of Adjective, stated by Siddiqui 
(1971), gives the linguistic evidence of this phenomenon. Flexibility of vaalaa-morpheme to 
allow any complex construction be preceded, adds complication in its computational 
analysis. It can be used with verbs and nouns. It can be arbitrarily long and complex as: 

ue S31 Nites SW A Bis WARS IV aad Wes Me US fab 
(The handkerchief belonging to the girl in the picture with the frame whose color was same 
as the color of the book belonging to the man who was talked about in the morning 
conversation) 

This construction seems ridiculous but is valid, grammatically. Therefore, keeping such 
constructions generative via grammar seems logical. 

Thus next grammar rule is accommodating the vaalaa-construction. For the purpose of 
this rule, a new Part of Speech —~i4b-‘|| (vaalaa-morpheme) is needed, which has only 
one main entry —‘!l, in all its forms (on the bases of gender, number, and case of the 
following word). 


vaalaaPhrase > NounPhrase 
vaalaa-morpheme: Gender=\Gender , 
‘Number=-Number, ?Case=1Case, 
*Count= Count, *Honor= Honor ; 

SubstantivePhrase > van 
[vaalaPhrase]* 


SubstantiveNoun: ?Gender=4Gender , 


*Number=/Number,  Case=/Case, 
*Count=tCount, Honor=JHonor; 


Numeral particles 
There are some numeral morphemes that change the grammatical behavior of main word. 


These are referred to as Numeral Particles collectively. The ellipses are just shown here to 
indicate that previous rule is extended, not replaced. 
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NumeralPhrases fan 


CompositeCardinal — 
MultiplicativePartical 
] 


NumeralPhr ase> aa 4 


Compos7teCar dinal 
FractionalPartical 


CompositeCardinal 


] 


Conclusion and future work 


The literature review from native and foreign grammar writers gave a detailed 
understanding about the internal structure of the Noun Phrase in Urdu. It is a complex 
structure; still the work is in progress. The following items are yet to be included in this 
grammar: Intensity sub-phrase under the Adjectives, More about vaalaa Construction, Units 
and Rates, Combination of Numerals with Adjectives, Combination of Numerals with 
Nominal, Coordinate Conjunction of NP ( —S ôe 3 ul 233 Aet 45), and Coordinate 
Conjunction of Adjectival Phrase (2 384 GS4 3 al «JS Gaus «tid od 28), 
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